PRODUCTION OF SYRINGYL LTGNTN IN GYMNOSPERMS 
Field of the Invention 

This application claims the benefit of U.S. Provisional Application number 60/033,381, filed 
December 16, 1996. The invention relates to the molecular modification of gymnosperms in order to 
5 cause the production of syringyl units during lignin biosynthesis and to production and propagation of 
gymnosperms containing syringyl lignin. 
Background Qf tlfe Invention 

Lignin is a major part of the supportive structure of most woody plants including angiosperm 
and gymnosperm trees which in turn are the principal sources of fiber for making paper and cellulosic 

10 products. In order to liberate fibers from wood structure in a manner suitable for making many grades 
of paper, it is necessary to remove much of the lignin from the fiber/lignin network. Lignin is removed 
from wood chips by treatment of the chips in an alkaline solution at elevated temperatures and pressure 
in an initial step of papermaking processes. The rate of removal of lignin from wood of different tree 
species varies depending upon lignin structure. Three different lignin structures have been identified in 

15 trees: p-hydroxyphenyl, guaiacyl and syringyl, which are illustrated in Fig. I. 

Angiosperm species, such as Liquidambar styraciflua L. [sweetgum], have lignin composed of a 
mixture of guaiacyl and syringyl monomer units. In contrast, gymnosperm species such as Pinus taeda 
L. [loblolly pine] have lignin which is devoid of syringyl monomer units. Generally speaking, the rate 
of delignification in a pulping process is directly proportional to the amount of syringyl lignin present in 

20 the wood. The higher delignification rates associated with species having a greater proportion of 

syringyl lignin result in more efficient pulp mill operations since the mills make better use of energy 
and capital investment and the environmental impact is lessened due to a decrease in chemicals used for 
delignification. 

It is therefore an object of the invention to provide gymnosperm species which are easier to 
25 delignify in pulping processes. 

Another object of the invention is to provide gymnosperm species such as loblolly pine which 
contain syringyl lignin. 

An additional object of the invention is to provide a method for modifying genes involved in 
ligning biosynthesis in gymnosperm species so that production of syringyl lignin is increased while 
30 production of guaiacyl lignin is suppressed. 

Still another object of the invention is to produce whole gymnosperm plants containing genes 
which increase production of syringyl lignin and repress production of guaiacyl lignin. 

Yet another object of the invention is to identify, isolate and/or clone those genes in 
angiosperms responsible for production of syringyl lignin. 
35 A further object of the invention is to provide, in gymnosperms, genes which produce syringyl 
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lignin. 

Another object of the invention is to provide a method for making an expression cassette 
insertable into a gymnosperm cell for the purpose of inducing formation of syringyUgnin in a 
gymnosperm plant derived from the cell. 
Definitions 

The term "promoter" refers to a DNA sequence in the 5' flanking region of a given gene which 
is involved in recognition and binding of RNA polymerase and other transcriptional proteins and is 
required to initiate DNA transcription in cells. 

The term "constitutive promoter", refers to a promoter which activates transcription of a desired 
gene, and is commonly used in creation of an expression cassette designed for preliminary experiments 
relative to testing of gene function. An example of a constitutive promoter is 35S CaMV, available 
from Clonetech. 

The term "expression cassette" refers to a double stranded DNA sequence which contains both 
promoters and genes such that expression of a given gene is acheived upon insertion of the expression 

cassette into a plant cell. 

The term "plant" includes whole plants and portions of plants, including plant organs (e.g. 

roots, stems, leaves, etc.) 

The term "angiosperm" refers to plants which produce seeds encased in an ovary. A specific 
example of an angiosperm is Liquidambar styraciflua (L.)[sweetgum]. The angiosperm sweetgum 
produces syringyl lignin. 

The term "gymnosperm" refers to plants which produce naked seeds, that is, seeds which are 
not encased in an ovary. A specific example of a gymnosperm is Pinus taeda (L.)(loblolly pine]. The 
gymnosperm loblolly pine does not produce syringyl lignin. 
Summary of the Invention 

With regard to the above and other objects, the invention provides a method for inducing 
production of syringyl lignin in gymnosperms and to gymnosperms which contain syringyl lignin for 
improved delignification in the production of pulp for papermaking and other applications. In 
accordance with one of its aspects, the invention involves cloning an angiosperm DNA sequence which 
codes for enzymes involved in production of syringyl lignin monomer units, fusing the angiosperm 
DNA sequence to a lignin promoter region to form an expression cassette, and inserting the expression 
cassette into a gymnosperm genome. 

Enzymes required for production of syringyl lignin in an angiosperm are obtained by deducing 
an amino acid sequence of the enzyme, extrapolating an mRNA sequence from the amino acid 
sequence, constructing a probe for the corresponding DNA sequence and cloning the DNA sequence 
which codes for the desired enzyme. A promoter region specific to a gymnosperm lignin biosynthesis 
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gene is identified by constructing a probe for a gymnosperm lignin biosynthesis gene, sequencing the 5' 
flanking region of the DNA which encodes the gymnosperm lignin biosynthesis gene to locate a 
promoter sequence, and then cloning that sequence. 

An expression cassette is constructed by fusing the angiosperm syringyl lignin DNA sequence 
5 to the gymnosperm promoter DNA sequence. Alternatively, the angiosperm syringyl lignin DNA is 

fused to a constitutive promoter to form an expression cassette. The expression cassette is inserted into 
the gymnosperm genome to transform the gymnosperm genome. Cells containing the transformed 
genome are selected and used to produce a transformed gymnosperm plant containing syringyl lignin. 
In accordance with the invention, the angiosperm gene sequences bi-OMT, 4CL, FA5H-1 and 
10 FA5H-2 have been determined and isolated as associated with production of syringyl lignin in 

sweetgum and lignin promoter regions for the gymnosperm loblolly pine have been determined to be 
the 5" flanking regions for the 4CL1B, 4CL3B and PAL gymnosperm lignin genes. Expression 
cassettes containing sequences of selected genes from sweetgum have been inserted into loblolly pine 
embryogenic cells and presence of sweetgum genes associated with production of syringyl lignin has 
15 been confirmed' in daughter cells of the resulting loblolly pine embryogenic cells. 

The invention therefore enables production of gymnosperms such as loblolly pine containing 
genes which code for production of syringyl lignin, to thereby produce in such species syringyl lignin in 
the wood structure for enhanced pulpability. 
Brief Description of the Drawings 
20 The above and other aspects of the invention will now be further described in the following 

detailed specification considered in conjunction with the following drawings in which: 
Figl 1 illustrates a generalized pathway for lignin synthesis; and 

Fig. 2 illustrates a bifunctional-O-methyl transferase (bi-OMT) gene sequence involved in the 
production of syringjyl lignin in an angiosperm (SEQ ID 3); 
25 Fig. 3 illustrates a 4-coumarate CoA ligase ( 4CL) gene sequence involved in the production of 

syringyl lignin in an angiosperm (SEQ ID 4); 

Fig. 4 illustrates a ferulic acid-5-hydroxylase (FA5H-1) gene sequence involved in the 
production of syringyl lignin in an angiosperm (SEQ ID 1); 

Fig. 5 illustrates a ferulic acid-5-hydroxylase (FA5H-2) gene sequence involved in the 
30 production of syringyl lignin in an angiosperm (SEQ ID 2); 

Fig. 6 illustrates nucleotide sequences of the 5' flanking region of the loblolly pine 4CL1B gene 
showing the location of regulatory elements for lignin biosynthesis (SEQ ID 6); 

Fig. 7 illustrates nucleotide sequences of the 5' flanking region of the loblolly pine 4CL3B gene 
showing the location of regulatory elements for lignin biosynthesis (SEQ ID 7); 
'5 Fig. 8 illustrates nucleotide sequences of the 5' flanking region of loblolly pine PAL gene 
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showing the location of regulatory elements for lignin biosynthesis (SEQ ID 5); 

Fig. 9 illustrates a PCR confirmation of the sweetgum FA5H-1 gene sequence in transgenic 
loblolly pine cells; and 

Detailed Description of the Invention 
5 In accordance with the invention, a method is provided for modifying a gymnosperm genome, 

such as the genome of a loblolly pine, so that syringyl lignin will be produced in the resulting plant, 
thereby enabling cellulosic fibers of the same to be more easily separated from lignin in a pulping 
process. In general, this is accomplished by fusing one or more angiosperm DNA sequences (referred 
to at times herein as the "ASL DNA sequences") which are involved in production of syringyl lignin to 
10 a gymnosperm lignin promoter region (referred to at times herein as the "GL promoter region") specific 
to genes involved in gymnosperm lignin biosynthesis to form a gymnosperm syringyl lignin expression 
cassette (referred to at times herein as the "GSL expression cassette"). Alternatively, the one or more 
ASL DNA sequences are fused to one or more constitutive promoters to form a GSL expression 
cassette. 

15 The GSL expression cassette preferably also includes selectable marker genes which enable 

transformed cells to be differentiated from untransformed cells. The GSL expression cassette 
containing selectable marker genes is inserted into the gymnosperm genome and transformed cells are 
identified and selected, from which whole gymnosperm plants may be produced which exhibit 
production of syringyl lignin. 
20 To suppress production of less preferred forms of lignin in gymnosperms, such as guaiacyl 

lignin, genes from the gymnosperm associated with production of these less preferred forms of lignin 
are identified, isolated and the DNA sequence coding for anti-sense mRNA (referred to at times herein 
as the "GL anti-sense sequence") for these genes is produced. The DNA sequence coding for anti-sense 
mRNA is then incorporated into the gymnosperm genome, which when expressed bind to the less 
25 preferred guaiacyl gymnosperm lignin mRNA, inactivating it. 

Further features of these and various other steps and procedures associated with practice of the 
invention will now be described in more detail beginning with identification and isolation of ASL DNA 
sequences of interest for use in inducing production of syringyl lignin in a gymnosperm. 
— — Peterminflti Qn of DNA Seq ue n ce F or Genes Associated with Production of Svringvl Lig nin 

The general biosynthetic pathway for production of lignin has been postulated as shown in Fig. 
1. From Fig. 1, it can be seen that the genes CCL, OMT and F5H (which is from the class of P450 
genes) may play key roles in production of syringyl lignin in some plant species, but their specific 
contributions and mechanisms remain to be positively established. It is suspected that the CCL, OMT 
and F5H genes may have specific equivalents in a specific angiosperm, such as sweetgum. 
Accordingly, one aim of the present invention is to identify, sequence and clone specific genes of 
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interest from an angiosperm such as sweetgum which are involved in production of syringyl lignin and 
to then introduce those genes into the genome of a gymnosperm, such as loblolly pine, to induce 
production of syringyl lignin. 

Genes of interest may be identified in various ways, depending on how much information about 
5 the gene is already known. Genes believed to be associated with production of syringyl lignin have 
already been sequenced from a few angiosperm species, viz, CCL and OMT. 

DNA sequences of the various CCL and OMT genes are compared to each other to determine if 
there are conserved regions. Once the conserved regions of the DNA sequences are identified, oligo- 
dT primers homologous to the conserved sequences are synthesized. Reverse transcription of the 

10 DNA-free total RNA which was purified from sweetgum xylem tissue, followed by double PCR using 
gene-specific primers, enables production of probes for the CCL and OMT genes. 

A sweetgum cDNA library is constructed in a host, such as lambda ZAPII, available from 
Stratagene, of LaJolla, CA, using poly(A) + RNA isolated from sweetgum xylem, according to the 
methods described by Bugos et al. (1995 Biotechniques 19:734-737). The above mentioned probes are 

15 used to assay the sweetgum cDNA library to locate cDNA which codes for enzymes involved in 
production of syringyl lignin. Once a syringyl lignin sequence is located, it is then cloned and 
sequenced according to known methods which are familiar to those of ordinary skill. 

In accordance with the invention, two sweetgum syringyl lignin genes have been determined 
using the above-described technique. These genes have been designated 4CL and bi-OMT. The 

20 sequence obtained for the sweetgum syringyl lignin gene, designated bi-OMT, is illustrated in Fig. 2 
(SEQ ID 3). The sequence obtained for the sweetgum syringyl lignin gene, designated 4CL, is 
illustrated in Fig. 3 (SEQ ID 4). 

An alternative procedure was employed to identify the F5H equivalent genes in sweetgum. 
Because the DNA sequences for similar P450 genes from other plant species were known, probes for 

25 the P450 genes were designed based on the conserved regions found by comparing the known 

sequences for similar P450 genes. The known P450 sequences used for comparison include all plant 
P450 genes in the GenBank database. Primers were designed based on two highly conserved regions 
which are common to all known plant P450 genes. The primers were then used in a PCR reaction with 
the sweetgum cDNA library as a template. Once P450-like fragments were located, they were 

30 amplified using standard PCR techniques, cloned into a pBluescript vector available from Clonetech of 
Palo Alto, CA and transformed into a DH5a E. coli strain available from Gibco BRL of Gaithersburg, 
MD. 

After E. coli colonies were tested in order to determine that they contained the P450-like DNA 
fragments, the fragments were sequenced. Several P450-like sequences were located in sweetgum 
using the above described technique. One P450-like sequence was sufficiently different from other 
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known P450 sequences to indicate that it represented a new P450 gene family. This potentially new 
P450 cDNA fragment was used as a probe to screen a full length clone from the sweetgum xylem 
library. This putative hydroxylase clone was designated FA5H-1 . The sequence obtained for FA5H-1 
is illustrated in Fig. 4 (SEQ ID 1). 
5 II. Identification of GL Gene Promot er Regions 

In order to locate gymnosperm lignin promoter regions, probes are developed to locate lignin 
genes. After the-gymnosperm lignin gene is located, the portion of DNA upstream from the gene is 
sequenced, preferably using the GenomeWalker Kit, available from Clonetech. The portion of DNA 
upstream from the lignin gene will generally contain the gymnosperm lignin promoter region. 

10 Gymnosperm genes of interest include CCL-like genes and PAL-like genes, which are beleived 

to be involved in the production of lignin in gymnosperms. Preferred probe sequences are developed 
based on previously sequenced genes, which are available from the gene bank. The preferred gene 
bank accession numbers for the CCL-like genes include U39404 and U39405. A preferred gene bank 
accession number for a PAL-like gene is U39792. Probes for such genes are constructed according to 

15 methods familiar to those of ordinary skill in the art. A genomic DNA library is constructed and DNA 
fragments which code for gymnosperm lignin genes are then identified using the above mentioned 
probes. A preferred DNA library is obtained from the gymnosperm, Pinus taeda (L.)fLoblolly Pine], 
and a preferred host of the genomic library is Lambda Dashll, available from Stratagene of LaJolla, 
CA. 

20 Once the DNA fragments which code for the gymnosperm lignin genes are located, the 

genomic region upstream from the gymnosperm lignin gene (the 5'flanking region) was identified. This 
region contains the GL promoter. Three promoter regions were located from gymnosperm lignin 
biosynthesis genes. The first is the Shanking region of the loblolly pine 4CL1B gene, shown in Fig. 6 
(SEQ ID 6). The second is the 5* flanking region of the loblolly pine gene 4CL3B, shown in Fig. 7 

25 (SEQ ID 7). The third is the 5' flanking region of the loblolly pine gene PAL, shown in Fig. 8 (SEQ 
ID 5). 

„ III. Fusing the GL Promoter Region to the ASL DNA Sequence 

The next step of the process is to fuse the GL promoter region to the ASL DNA sequence to 
make a GSL expression cassette for insertion into the genome of a gymnosperm. This may be 
accomplished by standard techniques. In a preferred method, the GL promoter region is first cloned 
into a suitable vector. Preferred vectors are pGEM7Z, available from Promega, Madison, WI and SK 
available from Stratagene, of LaJolla, CA. After the promoter sequence is cloned into the vector, it is 
then released with suitable restriction enzymes. The ASL DNA sequence is released with the same 
restriction enzyme(s) and purified. 

The GL promoter region sequence and the ASL DNA sequence are then ljgated such as with T4 
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DNA ligase, available from Promega, to form the GSL expression cassette. Fusion of the GL and ASL 
DNA sequence is confirmed by restriction enzyme digestion and DNA sequencing. After confirmation 
of GL promoter-ASL DNA fusion, the GSL expression cassette is released from the original vector 
with suitable restriction enzymes and used in construction of vectors for plant transformation. 
IV. Fusing the ASL DNA Sequence to a Constitutive Promoter Region 

In an alternative embodiment, a standard constitutive promoter may be fused with the ASL 
DNA sequence to make a GSL expression cassette. For example, a standard constitutive promoter may 
be fused with FA5H-1 to form an expression cassette for insertion of FA5H-1 sequences into a 
gymnosperm genome. In addition, a standard constitutive promoter may be fused with FA5H-2 to form 
an expression cassette for insertion of FA5H-2 into a gymnosperm genome. A constitutive promoter 
for use in the invention is the double 35S promoter, available from Clonetech. 

In the preferred practice of the invention using constitutive promters, a suitable vector such as 
pBi221, is digested Xbal and HindlH to release the 35S promoter. At the same time the vector pHygro, 
available from International Paper, was disgested by Xbal and HindlH to release the double 35S 
15 promoter. The double 35S promoter was ligated to the previously digested pBi221 vector to produce a 
new pBi221 with the double 35S promoter. This new pBi221 was digested with Sad and Smal, to 
release the GUS fragment. The vector is next treated with T4 DNA polymerase to produce blunt ends 
and the vector is self-Iigated. This vector is then further digested with BamHI and Xbal, available from 
Promega. After the pBi221 vector containing the constitutive promoter region has been prepared, 
20 lignin gene sequences are prepared for insertion into the pBi221 vector. 

The coding regions of sweetgum FA5H-1 or FA5H-2 are amplified by PCR using primer with 
restriction sites incorporated in the 5' and 3 1 ends. In one example, an Xbal site was incorporated at 
the 5' end and a BamHI site was incorporated at the 3* end of the sweetgum FA5H-1 or FA5H-2 genes. 
After PCR, the FA5H-1 and FA5H-2 genes were separately cloned into a TA vector available from 
25 Invitrogen. The TA vectors containing the FA5H-1 and FA5H-2 genes, respectively, were digested by 
Xbal and BamHI to release the FA5H-1 or FA5H-2 sequences. 

The p35SS vector, described above, and the isolated sweetgum FA5H-1 or FA5H-2 fragments 
were then ligated to make GLS expression cassettes containing the constiutive promoter. 
^ — Inse rting the Expression Cassette into the Gymnosperm Genome 
30 There are a number of methods by which the GSL expression cassette may be inserted into a 

target gymnosperm cell. One method of inserting the expression cassette into the gymnosperm is by 
micro-projectile bombardment of gymnosperm cells. For example, embryogenic tissue cultures of 
loblolly pine may be initiated from immature zygotic embryos. Tissue is maintained in an 
undifferentiated state on semi-solid proliferation medium. For transformation, embryogenic tissue is 
suspended in liquid proliferation medium. Cells are then sieved through, a preferably 40 mesh screen, 
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to separate small, densely cytoplasmic cells from large vacuolar cells. 

After separation, a portion of the liquid cell suspension fraction is vacuum deposited onto filter 
paper and placed on semi-solid proliferation medium. The prepared gymnosperm target cells are then 
grown for several, days on filter paper discs in a petri dish. 

A 1 : 1 mixture of plasmid DNA containing the selectable marker expression cassette and 
plasmid DNA containing the FA5H-1 expression cassette may be precipitated with gold to form 
microprojectiles. "The microprojectiles are rinsed in absolute ethanol and aliqots are dried onto a 
suitable macrocarrier such as the macrocarrier available from BioRad in Hercules, CA. 

Prior to bombardment, embryogenic tissue is preferably desiccated under a sterile laminar-flow 
hood. The desiccated tissue is transferred to semi-solid proliferation medium. The prepared 
microprojectiles are accelerated from the macrocarrier into the desiccated target cells using a suitable 
apparatus such as a BioRad PDS-1000/HE particle gun. In a preferred method, each plate is 
bombarded once, rotated 180 degrees, and bombarded a second time. Preferred bombardment 
parameters are 1350 psi rupture disc pressure, 6 mm distance from the rupture disc to macrocarrier 
(gap distance), 1 cm macrocarrier travel distance, and 10 cm distance from macrocarrier stopping 
screen to culture plate (microcarrier travel distance). Tissue is then transferred to semi-solid 
proliferation medium containing a selection agent, such as hygromycin B, for two days after 
bombardment. 

Other methods of inserting the GSL expression cassette include use of silicon carbide whiskers, 
transformed protoplasts, Agrobacterium vectors and electroporation. 
VI. Identifying Transfo rmed Cells 

In general, insertion of the GSL expression cassette will typically be carried out in a mass of 
cells and it will be necessary to determine which cells harbor the recombinant DNA molecule 
containing the GSL expression cassette. Transformed cells are first identified by their ability to grow 
vigorously on a medium containing an antibiotic which is toxic to non-transformed cells. Preferred 
antibiotics are kanamycin and hygromycin B. Cells which grow vigorously on antibiotic containing 
medium are further tested for presence of either portions of the plasmid vector, the syringyl lignin 
genes in the GSL expression cassette; e.g. the angiosperm bi-OMT, 4CL, FA5H-1 or FA5H-2 gene, 
or by testing for presence of other fragments in the GSL expression cassette. Specific methods which 
can be used to test for presence of portions of the GSL expression cassette include Southern blotting 
with a labeled complementary probe or PCR amplification with specific complementary primers. In yet 
another approach, an expressed syringyl lignin enzyme can be detected by Western blotting with a 
specific antibody, or by assaying for a functional property such as the appearance of functional 
enzymatic activity. 



VII. Production of a Gvmnosperm Pla nt from the Transformed Gvmnosperm Cell 

Once transformed embryogenic cells of the gymnosperm have been identified, isolated and 

multiplied, they may be grown into plants. It is expected that all plants resulting from transformed cells 

will contain the GSL expression cassette in all their cells, and that \yood in the secondary growth stage 
5 of the mature plant will be characterized by the presence of syringyl lignin. 

Transgenic embryogenic cells are allowed to replicate and develop into a somatic embryo, 

which are then converted into a somatic seedling. 

YUL Identification. Production and Insertion of a GL mRNA Anti-Sense Sequence 

In addition to adding ASL DNA sequences, anti-sense sequences may be incorporated into a 

10 gymnosperm genome, via GSL expression cassettes, in order to suppress formation of the less preferred 
native gymnosperm lignin. To this end, the gymnosperm lignin gene is first located and sequenced in 
order to determine its nucleotide sequence. Methods for locating and sequencing amino acids which 
have been previously discussed may be employed. For example, if the gymnosperm lignin gene has 
already been purified, standard sequencing methods may be employed to determine the DNA nucleic 

15 acid sequence. 

If the gymnosperm lignin gene has not been purified and functionally similar DNA or mRNA 
sequences from similar species are known, those sequences may be compared to identify highly 
conserved regions and this information used as a basis for the construction of a probe. A gymnosperm 
cDNA or genomic library can be probed with the above mentioned sequences to locate the gymnosperm 

20 lignin cDNA or genomic DNA. Once the gymnosperm lignin DNA is located, it may be sequenced 
using standard sequencing methods. 

After the DNA sequence has been obtained for a gymnosperm lignin sequence, the 
complementary anti-sense strand is constructed and incorporated into an expression cassette. For 
example, the GL mRNA anti-sense sequence may be fused to a promoter region to form an expression 

25 cassette as described above. In a preferred method, the GL mRNA anti-sense sequence is incorporated 
into the previously discussed GSL expression cassette which is inserted into the gymnosperm genome as 
described above. 

IX t Incision of Cytochrome P45Q Reductase (C PR ) to E nhance Biosynthesis of Svringvl Lignin in 
CiYmn,QSperm.<> 

30 

In the absence of external cofactors such as NADPH (an electron donor in reductive 
biosyntheses), certain angiosperm lignin genes such as the FA5H genes may remain inactive or not 
acheive full or desired activity after insertion into the genome of a gymnosperm. Inactivity or 
insufficient activity can be determined by testing the resulting plant which contains the FA5H genes for 
5 5 the presence of syringyl lignin in secondary growth. It is known that cytochrome P450 reductase (CPR) 
may be involved in promoting certain reductive biochemical reactions, and may activate the desired 
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expression of genes in many plants. Accordingly, if it is desired to enhance the expression of the 
angiosperm syringyl lignin genes in the gymnosperm, CPR may be inserted in the gymnosperm 
genome. In order to express CPR, the DNA sequence of the enzyme is ligated to a constitutive 
promoter or, for a specific species such as Joblolly pine, xylem-specific lignin promoters such as PAL, 
4CL1B or 4CL3B to form an expression cassette. The expression cassette may then be inserted into the 
gymnosperm genome by various methods as described above. 
X. Examples " 

The following non-limiting examples illustrate further aspects of the invention. In these 
examples, the angiosperm is Liquidambar styraciflua (L.)[sweetgum] and the gymnosperm is Pinus 
taeda (L.)[loblolly pine]. The nomenclature for the genes refered to in the examples is as follows: 





Biochemical Name ; 


4CL (angiosperm) 


4-coumarate CoA ligase 


bi-OMT (angiosperm) 


bifunctional-O-methyl transferase 


FA5H-1 (angiosperm) 


ferulic acid-5-hydroxylase 


FA5H-2 (angiosperm) 


ferulic acid-5-hydroxylase 


PAL (gymnosperm) 


phenylalanine ammonia-lyase 


4CL1B (gymnosperm) 


4-coumarate CoA ligase 


4CL3B (gymnosperm) 


4-coumarate CoA ligase 



Example 1 - Isolating and Sequencing bi-OMT and 4CL Genes from an Angiosperm 

A cDNA library for Sweetgum was constructed in Lambda ZAPII, available from Stratagene, 
of LaJolIa, CA, using poly(A) +RNA isolated from Sweetgum xylem tissue. Probes for bi-OMT and 
4CL were obtained through reverse transcription of their mRNAs and followed by double PCR using 
gene-specific primers which were designed based on the OMT and CCL cDNA sequences obtained 
from similar genes cloned from other species. 

Three primers were used for amplifying OMT fragments. One was an oligo-dT primer, bi- 
OMT, (which was cloned through modified differential display technique, as described below in 
Example 2) and the other two were degenerate primers, which were based on the conserved sequences 
of all known OMTs. The two degenerate primers were derived based on the following amino acid 
sequences: 

5*- Gly Gly Met Ala Thr Tyr Cys Cys Ala Thr Thr Tyr Ala Ala Cys Ala Ala Gly Gly Cys-3' 
(primer #22) and 

3'-Ala Ala Ala Gly Ala Gly Ala Gly Asn Ala Cys Asn Asn Ala Asn Asn Ala Asn Gly Ala-5* 
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(primer #23). 

A 900 bp PCR product was produced when oligo-dT primer and primer #22 were used, and a 
550 bp fragment was produced when primer numbers 22 and were used. 

Three primers were used for amplifying CCL fragments. They were derived from the 
5 following amino acid sequences: 

5'-Thr Thr Gly Gly Ala Thr Cys Cys Gly Gly He Ala Cys He Ala Cys He Gly Gly He Tyr Thr 
He Cys Cys He Ala Ala Arg Gly GIy-3' (primer R1S) 

5'-Thr Thr Gly Gly Ala Thr Cys Cys Gly Thr He Gly Thr He Gly Cys He Cys Ala Arg Cys 
10 Ala Arg Gly Thr He Gly Ala Tyr Gly Gly-3* (primer HIS) and 

3 f -Cys Cys He Cys Thr Tyr Thr Ala Asp Ala Cys Arg Thr Ala Asp Gly Cys He Cys Cys Ala 
Gly Cys Thr Gly Thr Ala-5' (primer R2A) 

R1S and HIS were both sense primers. Primer R2A was an anti-sense primer. A 650 bp 
15 fragment was produced if R1S and R2A primers were used and a 550 bp fragment was produced when 
primers HIS and R2A were used. The sequence of these three primers were derived from conserved 
sequences for plant CCLs. 

The reverse transcription-double PCR cloning technique used for these examples consisted of 
adding 10 /xm of DNA-free total RNA in 25/xg DEPC-treated water to a microfiige tube. Next, the 
20 following solutions were added: 

a. 5x Reverse transcript buffer 8.0/xl, 

b. 0.1 fiM DDT 4.0 /xl 

c. 10 mM dNTP 2.0 fx\ 

d. 100 /xM oligo-dT primers 8.0 /xl 
25 e. Rnasin2.0xxl 

f. Superscript II 1 .0 /xl 

After mixing, the tube was incubated at a temperature of 42° C for one (1) hour, followed by 
incubation at 70° C for fifteen (15) minutes. Forty (40) til of IN NaOH was added and the tube was 

30 further incubated at 68° C for twenty (20) minutes. After the incubation periods, 80 /xl of IN HC1 was 
added to the reaction mixture. At the same time, 17 xxl NaOAc, 5 /xl glycogen and 768 /xl of 100% 
ethanol were added and the reaction mixture was maintained at -80° C for 15 minutes in order to 
precipitate the cDNA. The precipitated cDNA was centrifuged at high speed at 4° C for 15 minutes. 
The resulting pellet was washed with 70% ethanol and then dried at room temperature, and then was 

35 dissolved in 20 /xl of water. 

The foregoing procedure produced purified cDNA which was used as a template to carry out 
first round PCR using primers #22 and oligo-dT for cloning OMT cDNA and primer R IS and R2A for 
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cloning 4CL cDNA. For the first round PCR, a master mix of 50/xl for each reaction was prepared. 
Each 50/xl mixture contained: 

a. lOx buffer 5/xl 

b. 25mMMgC15/xl 

c. 100 AiM sense primer 1/xl (primer #22 for OMT and primer R IS for CCL). 

d. 100 /xl anti-sense primer 1 /xl (oligo-dT primer for OMT and R2A for CCL). 

e. 10 mM dNTP 1 /xl 

f. Taq. DNA polymerase 0.5 /xl 

Of this master mix, 48 /xl was added into a PCR tube containing 2 /xl of cDNA for PCR. The 
tube was heated to 95° C for 45 seconds, 52°C for one minute and 72° C for two minutes. This 
temperature cycle was repeated for 40 cycles and the mixture was then held at 72° C for 10 minutes. 

The cDNA fragments obtained from the first round of PCR were used as templates to perform 
the second round of PCR using primers 22 and 23 for cloning bi-OMT cDNA and primer HIS and 
R2A for cloning 4CL cDNA. The second round of PCR conditions were the same as the first round. 

The desired cDNA fragment was then sub-cloned and sequenced. After the second round of 
PCR, the product with the predicted size was excised from the gel and ligated into a pUC19 vector, 
available from Clonetech, of Palo Alto, CA, and then transformed into DH5a, an E. coli strain , 
available from Gibco BRL, of Gaithersburg, MD. After the inserts had been checked for correct size, 
the colonies were isolated and plasmids were sequenced using a Sequenase kit available from USB, of 
Cleveland, OH. The sequences are shown in Fig. 2 (SEQ ID 3) and Fig. 3 (SEQ ID 4). 
Example 2 - Alternative Isolation Method of A ngiosperm bi-OMT gene 

As previously mentioned, one bi-OMT clone was produced via modified differential display 
technique. This method is another type of reverse transcription-PCR, in which DNA-free total RNA 
was reverse transcribed using oligoniT primers with a single base pair anchor to form cDNA. The 
oligo-dT primers used for reverse transcription of mRNA to synthesize cDNA were: 
Tl 1 A: TTTTTTTTTTTTTTA, 
T11C: TTTTTTTTTTTTTTC, and 
TUG: TTTTTTTTTTTTTTG, 

These cDNAs were then used as templates for radioactive PCR which was conducted in the 
presence of the same oligo-dT primers as listed above, a bi-OMT gene-specific primer and 35S-dATP. 
The OMT gene-specific primer was derived from the following amino acid sequence: 5'-Cys Cys Asn 
Gly Gly Asn Gly Gly Ser Ala Arg Gly Ala-3\ 

The following PCR reaction solutions were combined in a microfuge tube: 

a. H20 9.2/xI, 

b. Taq Buffer 2.0/xl 

c. dNTP(25/xM) 1.6/xl 
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d. Primers (5 /*M) 2 /zl, for each primer 
__j^35S-dATP 1/xl 

f. Taq. pol. 0.2^1 

g. cDNA2.0^!. 

The tube was heated to a temperature of 94° C and held for 45 seconds, then at 37° C for 2 % 
minutes and then 72 °C for 45 seconds for forty cycles, followed by a final reaction at 72°C for 5 
minutes. 

The amplified products were fractionated on a denaturing polyacrylamide sequencing gel and 
autoradiography was used to identify and excise the fragments with a predicted size. The designed 
OMT gene-specific primer had a sequence conserved in a region toward the 3*-end of the OMT cDNA 
sequence. This primer, together with oligo-dT, was amplified into a OMT cDNA fragment of about 
300 bp. 

Three oligo-dTs with a single base pair of A, C or G, respectively, were used to pair with the 
OMT gene-specific primer. Eight potential OMT cDNA fragments with predicted sizes of about 300 bp 
were excised from the gels after several independent PCR rounds using different combinations of 
oligo-dT and OMT gene-specific oligo-nucleotides as primers. 

The OMT cDNA fragments were then re-amplified. A Southern blot analysis was performed 
for the resulting cDNAs using a 360 base-pair, 32P radio-isotope labeled, aspen OMT cDNA 3'-end 
fragment as a probe to identify the cDNA fragments having a strong hybridization signal, under low 
stringency conditions. Eight fragments were identified. Out of these eight cDNA fragments, three 
were selected based on their high hybridization signal for sub-cloning and sequencing. One clone, 
LsOMT3'-l, (where the "Ls" prefix indicates that the clone was derived from the Liquidambar 
styraciflua (L.) genome) was confirmed to encode bi-OMT based on its high homology to other 
lignin-specific plant OMTs at both nucleotide and amino acid sequence levels. 

A cDNA library was constructed in Lambda ZAP II, available from Stratagene, of LaJolIa, 
CA, using 5mg poly(A)+RNA isolated from sweetgum xylem tissue. The primary library consisting of 
approximately 0.7 x 106 independent recombinants was amplified and approximately 105 
plaque-forming-units (pfu) were screened using a homologous 550 base-pair probe. The hybridized 
filter was washed at high stringency (0.25 x SSC, 0. 1 % SDS, 65° C) conditions. The colony 
containing the bi-OMT fragment identified by the probe was eluted and the bi-OMT fragment was 
produced. The sequence as illustrated in Fig. 2 (SEQ ID 3) was obtained. 
Example 3 - Isolating a nd Producing the DNA w hi ch codes for the Ang ios perm FA5H-1 Gene 

In order to find putative FA5H cDNA fragments as probes for cDNA library screening, a 
highly degenerated sense primer based on the amino acid sequence of 5'-Glu, Glu, Phe, Arg, Pro, Glu, 
Arg-3' was designed based on the conserved regions found in some plant P450 proteins. This 
conserved domain was located upstream of another highly conserved region in P450 proteins, which 
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had an amino acid sequence of 5'-Phe Gly Xaa Gly Xaa Xaa Cys Xaa Gly-3\ This primer was 
^/Synthesized with the incorporation of an Xbol restriction site to give a 26-base-pair oligomer with a 
nucleotide sequence of 5'ATG TGC AGT TTT TTT TTT TTT TTT TT-3\ 

This primer and the oligo-dT-XhoI primer were then used to perform PCR reactions with the 

5 sweetgum cDNA library as a template. The cDNA library was constructed in Lambda ZAPII, available 
from Stratagene, of LaJolla, CA, using po!y(a) +RNA isolated from Sweetgum xylem tissue. 
Amplified fragments of 300 to 600 bp were obtained. Because the designed primer was located 
upstream of the highly conserved P450 domain, this design distinguished whether the PCR products 
were P450 gene fragments depending on whether they contained the highly conserved amino acid 

10 domain. 

All the fragments obtained from the PCR reaction were then cloned into a pUC19 vector, 
available from Stratagene, of LaJolla, CA, and transformed into a DH5a E. coli strain, available from 
Gibco BRL, of Gaithersburg, MD. 

Twenty-four positive colonies were obtained and sequenced. Sequence analysis indicated four 

15 groupings withing the twenty-four colonies. One was C4H, one was an unknown P450 gene, and two 
did not belong to P450 genes. Homologies of P450 genes in different species are usually more than 
80%. Because the homologies between the P450 gene families found here were around 40%, the 
sequence analysis indicated that a new P450 gene family was sequenced. Moreover, since this P450 
cDNA was isolated from xylem tissue, it was highly probable that this P450 gene was FA5H-1. 

20 The novel sweetgum P450 cDNA fragment was used as a probe to screen a full length cDNA 

encoding for FA5H-1. Once the FA5H-1 gene was located it was sequenced. The length of the 
FA5H-1 cDNA is 1707 bp and it contains 45 bp of 5' non-coding region and 135 bp of 3* non-coding 
region. The deduced amino acid sequence also indicates that this P450 cDNA has a hydrophobic core 
at the N-terminal, which could be regarded as a leader sequence for c-transiational targeting to 

25 membranes during protein synthesis. At the C-terminal region, there is a heme binding domain that is 
characteristic of all P450 genes. The FA5H-1 sequence, as illustrated in Fig. 4 (SEQ ID 1), was 
produced, according to the above described methods. 

Example 4 - Isolating and Producing the DNA which codes for the Angiosperm FA5H-2 Gene 
By using similar strategy of synthesizing PCR primers from the published literature for 

30 hydroxylase genes in plants, another full length FA5H cDNA has been isolated that shows significant 
similarity with a putitive F5H clone from Arabidopsis (Meyers et al. 1996: PNAS 93, 6869-6874). 
This cloned cDNA, designated FA5H-2, contains 1883 bp and encodes an open reading frame of 51 1 
amino acids. The amino acid similarity shared between Arabidopsis F5H and the FA5H-2 sweetgum 
clone is about 75%, indicating that the isolated clone belongs to the same class of cDNAs that encode a 

35 F5H protein, which has been shown to be functional by genetic complimentation in Arabidopsis. 
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To confirm the function of the FA5H-2 gene, it was expressed in E.coli, strain, DH5 alpha, via 
pQE vector preparation, according to directions available with the kit. A CO-Fe2+ binding assay was 
also performed to confirm the expression of FA5H-2 as a functional P450 gene. (Omura & Sato 1964, 
J. of Biochemistry 239: 2370-2378, Babriac et.al. 1991 Archives of Biochemistry and Biophysics 
5 288:302-309). The CO-Fe2+ binding assay showed a peak at 450nm which indicates that FA5H-2 has 
been overexpressed as a functional P450 gene. 

The FA5H-2 protein was further purified for production of antibodies in rabbits, and antibodies 
have been successfully produced. In addition, Western blots show that this antibody is specific to the 
membrane fraction of sweetgum and aspen xylem extract. When the FA5H-2 antibody was added to a 

10 reaction mixture containing aspen xylem tissue, enzyme inhibition studies showed that the activity of 
FA5H in aspen was reduced more than 60%, a further indication that FA5H-2 performs a P450-like 
function. Fig. 5 (SEQ ID 2) illustrates the FA5H-2 sequence. 
Sample 5 - Identifying Oymnosperm Promoter Regions 

In order to identify gymnosperm promoter regions, sequences from loblolly pine PAL and 

15 4CL1B and 4CL3B lignin genes were used as primers to screen the loblolly pine genomic library, using 
the GenomeWalker Kit. The loblolly pine PAL primer sequence was obtained from the GenBank, 
reference number U39792. The loblolly pine 4CL1B primer sequences were also obtained from the 
gene bank, reference numbers U39404 and U39405. 

The loblolly pine genomic library was constructed in Lambda Dashll, available from 

20 Stratagene, of LaJolla, CA. 3 x 106 phage plaques from the genomic library of loblolly pine were 

screened using both the above mentioned PAL cDNA and 4CL (PCR clone) fragments as probes. Five 
4CL clones were obtained after screening. Lambda DNAs of two 4CL of the five 4CL clones obtained 
after screening were isolated and digested by EcoRV, PstI, Sail and Xbal for Southern analysis. 
Southern analysis using 4CL fragments as probes indicated that both clones for the 4CL gene were 

25 identical. Results from further mapping showed that none of the original five 4CL clones contained 
promoter regions. When tested, the PAL clones obtained from the screening also did not contain 
promoter regions. 

In a second attempt to clone the promoter regions associated with the PAL and 4CL a Universal 
GenomeWalkerTM kit, available from CLONETECH, was used. In the process, total DNA from 

30 loblolly pine was digested by several restriction enzymes and ligated into the adaptors (libraries) 

provied with the kit. Two gene-specific primers for each gene were designed (GSP1 and 2). After two 
rounds of PCR using these primers and adapter primers of the kit, several fragments were amplified 
from each library. A 1.6 kb fragment and a 0.6 kb fragment for PAL gene and a 2.3 kb fragment 
(4CL1B) and a 0.7 kb fragment (4CL3B) for the 4CL gene were cloned, sequenced and found to 

35 contain promoter regions for all three genes. See Fig. 6 (SEQ ID 6), 7 (SEQ ID 7) and 8 (SEQ ID 5). 
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fiyam ple 6 - Fusing the ASL DNA Sequence to A Constitutive Promoter Region and Inserting the 
Expression C assette Into a Gymnosperm Genome 

As a first step, a ASL DNA sequence, FA5H-1, was fused with a constitutive promoter region 
5 according to the methods described in the above Section IV to form an FA5H-1 expression cassette. A 
second ASL DNA sequence, FA5H-2, was then fused with a constitutive promoter in the same manner 
to form an FA5H-2 expression cassette. The FA5H-1 expression cassette was inserted into the 
gymnosperm genome by micro-projectile bombaradment. Embryogenic tissue cultures of loblolly pine 
were initiated from immature zygotic embryos. The tissue was maintained in an undifferentiated state 
10 on semi-solid proliferation medium, according to methods described by Newton et al. TAES Technical 
Publication "Somatic Embryogenesis in Slash Pine", 1995 and Keinonen-Mettala et al. 1996, Scand. J. 
For. Res. 11: 242-250. 

After separation, 5 ml of the liquid cell suspension fraction which passes through the 40 mesh 
screen was vacuum deposited onto filter paper and placed on semi-solid proliferation medium. The 

15 prepared gymnosperm target cells were then grown for 2 days on filter paper discs placed on semi-solid 
proliferation medium in a petri dish. These target cell were then bombarded with plasmid DNA 
containing the FA5H-1 expression cassette and an expression cassette containing a selectable marker 
gene encoding the enzyme which confers resistance to the antibiotic hygromycin B. A 1: 1 mixture of 
of selectable marker expression cassette and plasmid DNA containing the FA5H-1 expression cassette is 

20 precipitated with gold (1.5-3.0 microns) as described by Sanford et al. (1992). The DNA-coated 

microprojectiles were rinsed in absolute ethanol and aliqots of 10 /xl (5 /xg DNA/3mg gold) were dried 
onto a macrocarrier, such as those available from BioRad (Hercules, CA). 

Prior to bombardment, embryogenic tissue was desiccated under a sterile laminar-flow hood 
for 5 minutes. The desiccated tissue was transferred to semi-solid proliferation medium. The 

25 microprojectiles were accelerated into desiccated target cells using a BioRad PDS-1000/HE particle 
gun. 

Each plate was bombarded once, rotated 180 degrees, and bombarded a second time. Preferred 
bombardment parameters were 1350 psi rupture disc pressure, 6 mm distance from the rupture disc to 
macrocarrier (gap distance), 1 cm macrocarrier travel distance, and 10 cm distance from macrocarrier 
30 stopping screen to culture plate (microcarrier travel distance). Tissue was then transferred to semi-solid 
proliferation medium containing hygromycin B for two days after bombardment. 

The FA5H-2 expression cassette was inserted into the gymnosperm genome according to the 
same procedures. 

Example 7 - Selecting Transformed Target Cells 
35 After insertion of the FA5H-1 expression cassette and the selectable marker expression cassette 

into the gymnosperm target cells as described in Example 6, transformed cells were selected by 
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exposure to an antibiotic that causes mortality of any cells not containing the GSL expression cassette. 
Forty independent cell lines were established from cultures cobombarded with an expression cassette 
containing a hygromycin resistance gene construct and the FA5H-1 construct. These cell lines include 
lines Y2, Y17, Y7 and 04, as discussed in more detail below. 
5 PCR techniques were then used to verify that the FA5H-1 gene had been successfully integrated 

into the genomes of of the established cell lines by extracting genomic DNA using the Plant DNAeasy 
kit, available from-Quaign. 200 ng DNA from each cell line were used for each PCR reaction. Two 
FA5H-1 specific primers were designed to perform a PCR reaction with a 600bp PCR product size. 
The primers were: 

10 LsFa5H-iml-S primer: ATGGCTTTCCTTCTAATACCCATCTC , and 

LsFA5H-iml-A primer: GGGTGTAATGGACGAGCAAGGACTTG. 

Each PCR reaction (100 /xl) consisted of 75 /xl H20, 1 /xl MgCl (25 mM), 10 /xl PCR buffer 1 
/xl lOmM dNTPs, and 10 /xl DNA. 100 /xl oil was layered on the top of each reaction mix. Hot start 
PCR was done as follows: PCR reaction was incubated at 95 degrees C for 7 minutes and 1 /xl each of 

15 both LsFA5H-iml-S and LsFa5H-iml-A primers (100 /xM stock) and 1 /xl of Taq polymerase were 
added through oil in each reaction. The PCR program used was 95 degrees C for 1.5 minutes, 55 
degrees C for 45 sec and 72 degrees C for 2 minutes, repeated for 40 cycles, followed by extension at 
72 degrees C for 10 minutes. 

The above PCR products were employed to determine if gymnosperm cells contained the 

20 angiosperm lignin gene sequences. With reference to Fig. 9, PCR amplification was performed using 
template DNA from cells which grew vigorously on hygromycin B-eontaining medium. The PCR 
products were electrophoresis in an agarose gel containing 9 lanes. Lanes 1-4 contained PCR 
amplification of products of the Sweetgum FA5H-1 gene from a non-transformed control and transgenic 
loblolly pine cell lines. Lane 1 contained the non-transformed control PT52. Lane 2 contained 

25 transgenic line Y2. Lane 3 contained transgenic line Y17 and Lane 4 contained the plasmid which 
contains the expression cassette pSSLsFA5Hl-im-s. Lanes 2 through 4 all contain an amplified 
fragment of about 600 bp, indicating that the FA5H-1 gene has been successfully inserted into 
transgenic cell lines Y2 and Y17. 

Lane 5 contained a DNA size marker Phi 174/HaeIII (BRL). The top four bands in this lane 

30 indicate molecular sizes of 1353, 1078, 872 and 603 bp. 

Lanes 6-9 contained PCR amplification products of hygromycin B gene from non-transformed 
control and transgenic loblolly pine cell lines. Lane 6 contained the non-transformed control PT52 line, 

available from . Lane 7 contained transgenic line Y7. Lane 

8 contained transgenic line 04. Lane 9 contained the plasmid which includes the expression cassette 

35 containing the gene encoding the enzyme which confers resistance tot he antibiotic hygromycin B. 
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Lanes 7-9 all show an amplified fragment of about lOOObp, indicating that the hygromycin gene has 
been successfully inserted into transgenic lines Y7 and 04. 

These PCR results confirmed the presence of FA5H-1 and hygromycin resistance gene in 
transformed loblolly pine cell cultures. The results obtained from the PCR verification of 4 cell lines, 
5 and similar tests with the remaining 36 cell lines, confirm stable integration of the FA5H-1 gene and 
the hygromycin B gene in 25% of the 40 cell lines. 

In addition, loblolly pine embryogenic cells which have been co-bombarded with the FA5H-2 
and hygromycin B expression cassettes, are growing vigorously on hygromycin selection medium, 
indicating that the FA5H-2 expression cassette was successfully integrated into the gymnosperm 
10 genome. 

Although various embodiments and features of the invention have been described in the 
foregoing detailed description, those of ordinary skill will recognize the invention is capable of 
numerous modifications, rearrangements and substitutions without departing from the scope of the 
invention as set forth in the appended claims. For example, in the case where the lignin DNA sequence 

15 is transcribed and translated to produce a functional syringyl lignin gene, those of ordinary skill will 

recognize that because of codon degeneracy a number of polynucleotide sequences will encode the same 
gene. These variants are intended to be covered by the DNA sequences disclosed and claimed herein. 
In addition, the sequences claimed herein include those sequences with encode a gene having substantial 
functional identity with those claimed. Thus, in the case of syringyl lignin genes, for example, the 

20 DNA sequences include variant polynucleotide sequences encoding polypeptides which have substantial 
identity with the amino acid sequence of syringyl lignin and which show syringyl lignin activity in 
gymnosperms. 
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CGGCACGAGG AAACCCTAAA ACTCACCTCT CTTACCCTTT CTCTTCA ATG GCT TTC 

Met Ala Phe 



CTT CTA ATA CCC ATC 
Leu Leu He Pro lie 

CTC TAT CAA CGG CTC 
Leu Tyr Gin Arg Leu 

TGG CCG ATC GTC GGA 
Trp Pro He Val Gly 

CGG TGT TTC GCC GAG 
Arg Cys Phe Ala Glu 

GTG TGG TTC GGT TCA 
Val Trp Phe Gly Ser 

TTG GCT AAG GAA GTG 
Leu Ala Lys Glu Val 

AGG CAT AGG AGT AGA 
Arg His Arg Ser Arg 

GAC CTT ATA TGG GCT 
Asp Leu He Trp Ala 

AAG GTT TGT ACC CTC 
Lys Val Cys Thr Leu 

CTT AGA CCC ATT AGA 
Leu Arg Pro He Arg 

ATT TTT AAT GAC ACT 
He Phe Asn Asp Thr 

CTG GTG AAG AAG TAT 
Leu Val Lys Lys Tyr 

AGA CTC GCA TTT GGA 
Arg Leu Ala Phe Gly 

GAC GAG CAA GGA CTT 
Asp Glu Gin Gly Leu 

AAG CTT GGT GCC TCA 
Lys Leu Gly Ala Ser 

CGT TGG ATG TTC CCA 
Arg Trp Met Phe Pro 

GCA CGT AGG GAC CGA 
Ala Arg Arg Asp Arg 



TCA ATA ATC TTC ATC 
Ser He He Phe He 

AGA TTT AAG CTC CCA 
Arg Phe Lys Leu Pro 

AAC CTT TAC GAC ATA 
Asn Leu Tyr Asp He 

TGG TCA CAA GCG TAC 
Trp Ser Gin Ala Tyr 

ACG TTG AAT GTG ATC 
Thr Leu Asn Val He 

CTC AAG GAA AAA GAT 
Leu Lys Glu Lys Asp 

TCA GCT GCC AAA TTT 
Ser Ala Ala Lys Phe 

GAT TAT GGA CCT CAC 
Asp Tyr Gly Pro His 

GAG CTT TTT ACT CCA 
Glu Leu Phe Thr Pro 

GAA GAT GAA GTT ACA 
Glu Asp Glu Val Thr 

GCG AAT CCT GAA AAT 
Ala Asn Pro Glu Asn 

TTG GGA GCA GTA GCA 
Leu Gly Ala Val Ala 

AAG CGA TTC GTG AAT 
Lys Arg Phe Val Asn 

GAA TTT AAG GAA ATT 
Glu Phe Lys Glu He 

CTT GCA ATG GCT GAG 
Leu Ala Met Ala Glu 

CTT GAG GAA GGG GCC 
Leu Glu Glu Gly Ala 

CTT ACC AGA GCT ATC 
Leu Thr Arg Ala He 



GTC TTA GCT TAC CAG 
Val Leu Ala Tyr Gin 

CCC GGC CCA CGT CCA 
Pro Gly Pro Arg Pro 

AAA CCG GTG AGG TTC 
Lys Pro Val Arg Phe 

GGT CCG ATC ATA TCG 
Gly Pro He He Ser 

GTA TCG AAT TCG GAA 
Val Ser Asn Ser Glu 

CAA CAA TTG GCT GAT 
Gin Gin Leu Ala Asp 

AGC AGG GAT GGG CAG 
Ser Arg Asp Gly Gin 

TAT GTG AAG GTT ACA 
Tyr Val Lys Val Thr 

AAG CGG CTT GAA GCT 
Lys Arg Leu Glu Ala 

GCC ATG GTT GAG TCC 
Ala Met Val Glu Ser 

TAT GGG AAG AGT ATG 
Tyr Gly Lys Ser Met 

TTC AAC AAC ATT ACA 
Phe Asn Asn He Thr 

TCA GAG GGT GTA ATG 
Ser Glu Gly Val Met 

GTG GCC AAT GGA CTC 
Val Ala Asn Gly Leu 

CAC ATT CCT TGG CTC 
His He Pro Trp Leu 

TTT GCC AAG CAT GGG 
Phe Ala Lys His Gly 

ATG GAA GAG CAC ACA 
Met Glu Glu His Thr 



-20- 



ATA GCC CGT AAA AAG AGT GGT GGA GCC CAA CAA CAT TTC GTG GAT 866 
He Ala Arg Lys Lys Ser Gly Gly Ala Gin Gin His Phe Val Asp 

GCA TTG CTC ACC CTA CAA GAG AAA TAT GAC CTT AGC GAG GAC ACT 911 
Ala Leu Leu Thr Leu Gin Glu Lys Tyr Asp Leu Ser Glu Asp Thr 

ATT ATT GGG CTC CTT TGG GAT ATG ATC ACT GCA GGC ATG GAC AC A 956 
He He Gly Leu Leu Trp Asp Met He Thr Ala Gly Met Asp Thr 

ACC GCA ATC TCT GTC GAA TGG GCC ATG GCC GAG TTA ATT AAG AAC 1001 
Thr Ala He Ser Val Glu Trp Ala Met Ala Glu Leu He Lys Asn 

CCA AGG GTG CAA CAA AAA GCT CAA GAG GAG CTA GAC AAT GTA CTT 104 6 

Pro Arg Val Gin Gin Lys Ala Gin Glu Glu Leu Asp Asn Val Leu 

GGG TCC GAA CGT GTC CTG ACC GAA TTG GAC TTC TCA AGC CTC CCT 1091 
Gly Ser Glu Arg Val Leu Thr Glu Leu Asp Phe Ser Ser Leu Pro 

TAT CTA CAA TGT GTA GCC AAG GAG GCA CTA AGG CTG CAC CCT CCA 1136 
Tyr Leu Gin Cys Val Ala Lys Glu Ala Leu Arg Leu His Pro Pro 

ACA CCA CTA ATG CTC CCT CAT CGC GCC AAT GCC AAC GTC AAA ATT 1181 
Thr Pro Leu Met Leu Pro His Arg Ala Asn Ala Asn Val Lys He 

GGT GGC TAC GAC ATC CCT AAG GGA TCA AAT GTT CAT GTA AAT GTC 122 6 

Gly Gly Tyr Asp He Pro Lys Gly Ser Asn Val His Val Asn Val 

TGG GCC GTG GCT CGT GAT CCA GCA GTG TGG CGT GAC CCA CTA GAG 1271 
Trp Ala Val Ala Arg Asp Pro Ala Val Trp Arg Asp Pro Leu Glu 

TTT CGA CCG GAA CGG TTC TCT GAA GAC GAT GTC GAC ATG AAA GGT 1316 
Phe Arg Pro Glu Arg Phe Leu Glu Asp Asp Val Asp Met Lys Gly 

CAC GAT TAT AGG CTA CTG CCG TTT GGT GCA GGG AGG CGT GTT TGC 1361 
His Asp Tyr Arg Leu Leu Pro Phe Gly Ala Gly Arg Arg Val Cys 

CCC GGT GCA CAA CTT GGC ATC AAT TTG GTC ACA TCC ATG ATG GGT 14 06 

Pro Gly Ala Gin Leu Gly He Asn Leu Val Thr Ser Met Met Gly 

CAC CTA TTG CAC CAT TTC TAT TGG AGC CCT CCT AAA GGT GTA AAA 1451 
His Leu Leu His His Phe Tyr Trp Ser Pro Pro Lys Gly Val Lys 

CCA GAG GAG ATT GAC ATG TCA GAG AAT CCA GGA TTG GTC ACC TAC 14 96 

Pro Glu Glu He Asp Met Ser Glu Asn Pro Gly Leu Val Thr Tyr 

ATG CGA ACC CCG GTG CAA GCT GTT CCC ACT CCA AGG CTG CCT GCT 1541 
Met Arg Thr Pro Val Gin Ala Val Pro Thr Pro Arg Leu Pro Ala 

CAC TTG TAC AAA CGT GTA GCT GTG GAT ATG TAATTCTTAG TTTGTTATTA 1591 
His Leu Tyr Lys Arg Val Ala Val Asp Met 

TTCATGCTCT TAAGGTTTTG GACTTTGAAC TTATGATGAG ATTTGTAAAA TTCCAAGTGA 1651 
TCAAATGAAG AAAAGACCAA ATAAAAAGGC TTGACGATTT AAAAAAAAAA AAAAAAA 17 08 
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TGCAAACCTG CACAAACAAA GAGAGAGAAG AAGAAAAAGG 
AAGAGAGGAG AGAGAGAGAG AGAGAGAGAA GC 

CAT GGA TTC TTC TCT TCA TGA AGC CTT GCA ACC ACT ACC CAT GAC GCT 
Met Asp Ser Ser Leu His Glu Ala Leu Gin Pro Leu Pro Met Thr Leu 

GTT CTT CAT TAT ACC TTT GCT ACT CTT ATT GGG CCT AGT ATC TCG GCT 
Phe Phe lie lie Pro Leu Leu Leu Leu Leu Gly Leu Val Ser Arg Leu 



TCG CCA GAG ACT ACC ATA CCC ACC AGG CCC AAA AGG CTT ACC GGT GAT 
Arg Gin Arg Leu Pro Tyr Pro Pro Gly Pro Lys Gly Leu Pro Val lie 

CGG AAA CAT GCT CAT GAT GGA TCA ACT CAC TCA CCG AGG ACT CGC CAA 
Gly Asn Met Leu Met Met Asp Gin Leu Thr His Arg Gly Leu Ala Lys 

ACT CGC CAA ACA ATA CGG CGG TCT ATT CCA CCT CAA GAT GGG ATT CTT 
Leu Ala Lys Gin Tyr Gly Gly Leu Phe His Leu Lys Met Gly Phe Leu 



ACA CAT GGT GGC CGT TTC CAC ACC CGA CAT GGC TCG CCA AGT CCT TCA 
His Met Val Ala Val Ser Thr Pro Asp Met Ala Arg Gin Val Leu Gin 



AGT CCA AGA CAA CAT CTT CTC GAA CCG GCC AGC CAC CAT AGC CAT CAG 
Val Gin Asp Asn lie Phe Ser Asn Arg Pro Ala Thr lie Ala lie Ser 



CTA CCT CAC CTA TGA CCG AGC CGA CAT GGC CTT CGC TCA CTA CGG CCC 
Tyr Leu Thr Tyr Asp Arg Ala Asp Met Ala Phe Ala His Tyr Gly Pro 



GTT TTG GCG TCA GAT GCG TAA ACT CTG CGT CAT GAA ATT ATT TAG CCG 
Phe Trp Arg Gin Met Arg Lys Leu Cys Val Met Lys Leu Phe Ser Arg 



GAA ACG AGC CGA GTC GTG GGA GTC GGT CCG AGA CGA GGT CGA CTC GGC 
Lys Arg Ala Glu Ser Trp Glu Ser Val Arg Asp Glu Val Asp Ser Ala 



AGT ACG AGT GGT CGC GTC CAA TAT TGG GTC GAC GGT GAA TAT CGG CGA 
Val Arg Val Val Ala Ser Asn lie Gly Ser Thr Val Asn lie Gly Glu 



GCT GGT TTT TGC TCT GAC GAA GAA TAT TAC TTA CAG GGC GGC TTT TGG 
Leu Val Phe Ala Leu Thr Lys Asn lie Thr Tyr Arg Ala Ala Phe Gly 



GAC GAT CTC GCA TGA GGA CCA GGA CGA GTT CGT GGC CAT ACT GCA AGA 
Thr lie Ser His Glu Asp Gin Asp Glu Phe Val Ala lie Leu Gin Glu 

GTT TTC GCA GCT GTT TGG TGC TTT TAA TAT AGC TGA TTT TAT CCC TTG 
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Phe Ser Gin Leu Phe Gly Ala Phe 

GCT CAA ATG GGT TCC TCA GGG GAT 
Leu Lys Trp Val Pro Gin Gly lie 



Asn lie Ala Asp Phe He Pro Trp 

TAA CGT CAG GCT CAA CAA GGC ACG 
Asn Val Arg Leu Asn Lys Ala Arg 



AGG GGC GCT TGA TGG GTT TAT TGA CAA GAT CAT CGA CGA TCA TAT ACA 
Gly Ala Leu Asp Gly Phe He Asp Lys He He Asp Asp His He Gin 



GAA GGG GAG TAA AAA CTC GGA GGA GGT TGA TAC TGA TAT GGT AGA TGA 
Lys Gly Ser Lys Asn Ser Glu Glu Val Asp Thr Asp Met Val Asp Asp 



TTT ACT TGC TTT TTA CGG TGA GGA AGC CAA AGT AAG CGA ATC TGA CGA 
Leu Leu Ala Phe Tyr Gly Glu Glu Ala Lys Val Ser Glu Ser Asp Asp 



TCT TCA AAA TTC CAT CAA ACT CAC CAA AGA CAA CAT CAA AGC TAT CAT 
Leu Gin Asn Ser He Lys Leu Thr Lys Asp Asn He Lys Ala He Met 



GGA CGT AAT GTT TGG AGG GAC CGA AAC GGT GGC GTC CGC GAT TGA ATG 
Asp Val Met Phe Gly Gly Thr Glu Thr Val Ala Ser Ala He Glu Trp 

GGC CAT GAC GGA GCT GAT GAA AAG CCC AGA AGA TCT AAA GAA GGT CCA 
Ala Met Thr Glu Leu Met Lys Ser Pro Glu Asp Leu Lys Lys Val Gin 



ACA AGA ACT CGC CGT GGT GGT GGG TCT TGA CCG GCG AGT CGA AGA GAA 
Gin Glu Leu Ala Val Val Val Gly Leu Asp Arg Arg Val Glu Glu Lys 



AGA CTT CGA GAA GCT CAC CTA CTT GAA ATG CGT ACT GAA GGA AGT CCT 
Asp Phe Glu Lys Leu Thr Tyr Leu Lys Cys Val Leu Lys Glu Val Leu 



TCG CCT CCA CCC ACC CAT CCC ACT CCT CCT CCA CGA GAC TGC CGA GGA 
Arg Leu His Pro Pro He Pro Leu Leu Leu His Glu Thr Ala Glu Asp 



CGC CGA GGT CGG CGG CTA CTA CAT TCC GGC GAA ATC GCG GGT GAT GAT 
Ala Glu Val Gly Gly Tyr Tyr He Pro Ala Lys Ser Arg Val Met He 



CAA CGC GTG CGC CAT CGG CCG GGA CAA GAA CTC GTG GGC CGA CCC AGA 
Asn Ala Cys Ala He Gly Arg Asp Lys Asn Ser Trp Ala Asp Pro Asp 



TAC GTT TAG GCC CTC CAG GTT TCT CAA AGA CGG TGT GCC CGA TTT CAA 
Thr Phe Arg Pro Ser Arg Phe Leu Lys Asp Gly Val Pro Asp Phe Lys 



AGG GAA CAA CTT CGA GTT CAT CCC ATT CGG GTC AGG TCG TCG GTC TTG 
Gly Asn Asn Phe Glu Phe He Pro Phe Gly Ser Gly Arg Arg Ser Cys 



792 



840 



888 



936 



984 



1032 



1080 



1128 



1176 



1224 



1272 



1320 



1368 



1416 
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CCC CGG TAT GCA ACT CGG ACT CTA CGC GCT AGA GAC GAC TGT GGC TCA . 14 64 

Pro Gly Met Gin Leu Gly Leu Tyr Ala Leu Glu Thr Thr Val Ala His 

CCT CCT TCA CTG TTT CAC GTG GGA GTT GCC GGA CGG GAT GAA ACC GAG 1512 

Leu Leu His Cys Phe Thr Trp Glu Leu Pro Asp Gly Met Lys Pro Ser 



TGA ACT CGA GAT GAA TGA TGT GTT TGG ACT CAC CGC GCC AAG AGC GAT 156 0 

Glu Leu Glu Met Asn Asp Val Phe Gly Leu Thr Ala Pro Arg Ala lie 



TCG ACT CAC CGC CGT GCC GAG TCC ACG CCT TCT CTG TCC TCT CTA 16 05 
Arg Leu Thr Ala Val Pro Ser Pro Arg Leu Leu Cys Pro Leu Tyr 

TTGATCGAAT GATTGGGGGA GCTTTGTGGA GGGGCTTTTA TGGAGACTCT ATATATAGAT 16 6 5 

GGGAAGTGAA ACAACGACAG GTGAATGCTT GGATTTTTGG TATATATTGG GGAGGGAGGG 172 5 

GAAAAAAAAA ATAATGAAAG GAAAGAAAAG AGAGAATTTG AATTTCTCTT CCTCTGTGGA 178 5 

TAAAAGCCTC GTTTTTAATT GTTTTTATGT GGAGATATTT GTGTTTGTTT ATTTTTATCT 184 5 

CTTTTTTTGC AATAACACTC AAAAATAAAA AAAAAAAA 18 8 3 
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Information for Sequence ID #: 
Correspondence Address : 



Addressee : 
Street : 
City: 
Country: 
Zip: 

Computer Readable Form: 
Medium Type : 
Operating System: 
Software : 

Current Application Data: 
Attorney Information: 
Name : 

Registration Number: 
Reference/Docket Number: 
Telecommunication Information: 
Telephone : 
Telefax : 
Information for Seq ID No: 
Sequence Characteristics: 
Length : 
Type : 

Strandedness : 
Topology: 
Molecule Type : 
Hypothetical : 
Anti- Sense : 
Fragment Type: 
Original Source : 
Organism: 
Strain : 

Individual Isolate: 

Developmental State: 

Haplotype : 

Tissue Type: 

Cell Type: 

Cell Line: 

Organelle: 



3 <bi-OMT) 

Luedeka , Neely & Graham 

P.O. Box 1871 

Knoxville, TN 37901 

Mark S . Graham 

P.O. Box 1871 

Knoxville 

USA 

37901 

1.44 
DOS 
ASCII 
N/A 

Mark S . Graham 
32, 355 
50617.00 

(423) 546-4305 
(423) 523-4478 



1380 

DNA 

double 

linear 

cDNA 

No 

No 

N/A 

Liquidambar styraciflua (L.) 

Wild Type 

N/A 

sporophyte 

N/A 

xylem 

parenchyma 

N/A 

N/A 



-26- 



CGGCACGAGC CCTACCTCCT TTCTTGGAAA AATTTCCCCA TTCGATCACA ATCCGGGCCT 60 

CAAAAA ATG GGA TCA ACA AGC GAA ACG AAG ATG AGC CCG AGT GAA GCA 108 
Met Gly Ser Thr Ser Glu Thr Lys Met Ser Pro Ser Glu Ala 

GCA GCA GCA GAA GAA GAA GCA TTC GTA TTC GCT ATG CAA TTA ACC AGT 156 
Ala Ala Ala Glu Glu Glu Ala Phe Val Phe Ala Met Gin Leu Thr Ser 

GCT TCA GTT CTT CCC ATG GTC CTA AAA TCA GCC ATA GAG CTC GAC GTC 204 
Ala Ser Val Leu Pro Met Val Leu Lys Ser Ala lie Glu Leu Asp Val 

TTA GAA ATC ATG GCT AAA GCT GGT CCA GGT GCG CAC ATA TCC ACA TCT 2 52 

Leu Glu lie Met Ala Lys Ala Gly Pro Gly Ala His lie Ser Thr Ser 

GAC ATA GCC TCT AAG CTG CCC ACA AAG AAT CCA GAT GCA GCC GTC ATG 300 
Asp He Ala Ser Lys Leu Pro Thr Lys Asn Pro Asp Ala Ala Val Met 

CTT GAC CGT ATG CTC CGC CTC TTG GCT AGC TAC TCT GTT CTA ACG TGC 348 
Leu Asp Arg Met Leu Arg Leu Leu Ala Ser Tyr Ser Val Leu Thr Cys 

TCT CTC CGC ACC CTC CCT GAC GGC AAG ATC GAG AGG CTT TAC GGC CTT 396 
Ser Leu Arg Thr Leu Pro Asp Gly Lys lie Glu Arg Leu Tyr Gly Leu 

GCA CCC GTT TGT AAA TTC TTG ACC AGA AAC GAT GAT GGA GTC TCC ATA 444 
Ala Pro Val Cys Lys Phe Leu Thr Arg Asn Asp Asp Gly Val Ser He 

GCC GCT CTG TCT CTC ATG AAT CAA GAC AAG GTC CTC ATG GAG AGC TGG 4 92 

Ala Ala Leu Ser Leu Met Asn Gin Asp Lys Val Leu Met Glu Ser Trp 

TAC CAC TTG ACC GAG GCA GTT CTT GAA GGT GGA ATT CCA TTT AAC AAG 540 
Tyr His Leu Thr Glu Ala Val Leu Glu Gly Gly He Pro Phe Asn Lys 

GCC TAT GGA ATG ACA GCA TTT GAG TAC CAT GGC ACC GAT CCC AGA TTC 58 8 

Ala Tyr Gly Met Thr Ala Phe Glu Tyr His Gly Thr Asp Pro Arg Phe 

AAC ACA GTT TTC AAC AAT GGA ATG TCC AAT CAT TCG ACC ATT ACC ATG 6 36 

Asn Thr Val Phe Asn Asn Gly Met Ser Asn His Ser Thr He Thr Met 

AAG AAA ATC CTT GAG ACT TAC AAA GGG TTC GAG GGA CTT GGA TCT GTG 684 
Lys Lys He Leu Glu Thr Tyr Lys Gly Phe Glu Gly Leu Gly Ser Val 

GTT GAT GTT GGT GGT GGC ACT GGT GCC CAC CTT AAC ATG ATT ATC GCT 732 
Val Asp Val Gly Gly Gly Thr Gly Ala His Leu Asn Met He He Ala 

AAA TAC CCC ATG ATC AAG GGC ATT AAC TTC GAC TTG CCT CAT GTT ATT 780 
Lys Tyr Pro Met He Lys Gly He Asn Phe Asp Leu Pro His Val He 

GAG GAG GCT CCC TCC TAT CCT GGT GTG GAG CAT GTT GGT GGA GAT ATG 828 
Glu Glu Ala Pro Ser Tyr Pro Gly Val Glu His Val Gly Gly Asp Met 

TTT GTT AGT GTT CCA AAA GGA GAT GCC ATT TTC ATG AAG TGG ATA TGT 8 76 

Phe Val Ser Val Pro Lys Gly Asp Ala He Phe Met Lys Trp He Cys 
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CAT GAT TGG AGC GAT GAA CAC TGC TTG AAG TTT TTG AAG AAA TGT TAT 924 
His Asp Trp Ser Asp Glu His Cys Leu Lys Phe Leu Lys Lys Cys Tyr 

GAA GCA CTT CCA ACC AAT GGG AAG GTG ATC CTT GCT GAA TGC ATC CTC 972 
Glu Ala Leu Pro Thr Asn Gly Lys Val He Leu Ala Glu Cys He Leu 

CCC GTG GCG CCA GAC GCA AGC CTC CCC ACT AAG GCA GTG GTC CAT ATT 102 0 
Pro Val Ala Pro Asp Ala Ser Leu Pro Thr Lys Ala Val Val His He 

GAT GTC ATC ATG TTG GCT CAT AAC CCA GGT GGG AAA GAG AGA ACT GAG 1068 
Asp Val He Met Leu Ala His Asn Pro Gly Gly Lys Glu Arg Thr Glu 

AAG GAG TTT GAG GCC TTG GCC AAG GGG GCT GGA TTT GAA GGT TTC CGA 1116 
Lys Glu Phe Glu Ala Leu Ala Lys Gly Ala Gly Phe Glu Gly Phe Arg 

GTA GTA GCC TCG TGC GCT TAC AAT ACA TGG ATC ATC GAA TTT TTG AAG 1164 
Val Val Ala Ser Cys Ala Tyr Asn Thr Trp He He Glu Phe Leu Lys 

AAG ATT TGAGTCCTTA CTCGGCTTTG AGTACATAAT ACCAACTCCT TTTGGTTTTC 122 0 
Lys He 

GAGATTGTGA TTGTGATTGT GATTGTCTCT CTTTCGCAGT TGGCCTTATG AT AT AATG T A 128 0 

TCGTTAACTC GATCACAGAA GTGCAAAAGA CAGTGAATGT ACACTGCTTT ATAAAATAAA 1340 

AATTTTAAGA TTTTGATTCA TGTAAAAAAA AAAAAAAAAA 1380 
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Information for Sequence ID #: 
Correspondence Address : 



Addressee : 
Street : 
City: 
Country : 
Zip : 

Computer Readable Form: 
Medium Type : 
Operating System: 
Software : 

Current Application Data: 
Attorney Information: 
Name : 

Registration Number: 
Reference/Docket Number : 
Telecommunication Information: 
Telephone : 
Telefax : 
Information for Seq ID No: 
Sequence Characteristics: 
Length : 
Type: 

Strandedness : 
Topology: 
Molecule Type : 
Hypothetical : 
Anti-Sense : 
Fragment Type: 
Original Source : 
Organism: 
Strain: 

Individual Isolate : 
Developmental State : 
Haplotype : 
Tissue Type : 
Cell Type: 
Cell Line: 
Organelle : 



4 (4CL) 

Luedeka, Neely & Graham 

P.O. Box 1871 

Knoxville, TN 37901 

Mark S . Graham 

P.O. Box 1871 

Knoxville 

USA 

37901 

1.44 
DOS 
ASCII 
N/A 

Mark S . Graham 
32, 355 
50617.00 

(423) 546-4305 
(423) 523-4478 
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DNA 

double 

linear 

cDNA 

No 

No 

N/A 

Liquidambar styraciflua 

Wild Type 

N/A 

sporophyte 

N/A 

xylem 

parenchyma 

N/A 

N/A 
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CGGCACGAGC TCATTTTCCA CTTCTGGTTT GATCTCTGCA ATTCTTCCAT CAGTCCCTA 



ATG GAG ACC CAA ACA 
Met Glu Thr Gin Thr 

CTC CCC GAT ATC TAC 
Leu Pro Asp lie Tyr 

TGT TTC GAG AAC ATC 
Cys Phe Glu Asn lie 

AAT GGC GCA ACG GGC 
Asn Gly Ala Thr Gly 

ATT GCG CGC AAG GTC 
lie Ala Arg Lys Val 

CAA GGT GAC ATC ATC 
Gin Gly Asp lie lie 

GTG TTT TCA ATT CTC 
Val Phe Ser lie Leu 

GCC GCA AAC CCG TTT 
Ala Ala Asn Pro Phe 

AAA ACC TCC AAC GCC 
Lys Thr Ser Asn Ala 

GAG AAA GTG AAG GAC 
Glu Lys Val Lys Asp 

TGT ATA GAC TCA CCC 
Cys lie Asp Ser Pro 

AGT GAG GCG GAC GAG 
Ser Glu Ala Asp Glu 

GAT GAT GTG GTG GCG 
Asp Asp Val Val Ala 

CCA AAG GGG GTG ATG 
Pro Lys Gly Val Met 

GCG CAA CAG GTG GAC 
Ala Gin Gin Val Asp 

GAG GAC GTG GTT CTG 
Glu Asp Val Val Leu 

ATG AAC GTC ATG TTT 
Met Asn Val Met Phe 

ATT ATG CAG AAA TTT 
lie Met Gin Lys Phe 

AGT ACA GGT GAC CAT 



AAA CAA GAA GAA ATC 
Lys Gin Glu Glu lie 

ATC CCC AAA CAC CTC 
lie Pro Lys His Leu 

TCA CAG TTC GGC TCC 
Ser Gin Phe Gly Ser 

AAG TAT TAC ACA TAT 
Lys Tyr Tyr Thr Tyr 

GCA TCC GGC CTC AAC 
Ala Ser Gly Leu Asn 

ATG CTT TTG CTA CCC 
Met Leu Leu Leu Pro 

GGC GCA TCC TAC CGC 
Gly Ala Ser Tyr Arg 

TAT ACC CCT GCC GAG 
Tyr Thr Pro Ala Glu 

AGG CTT ATT ATC ACA 
Arg Leu lie lie Thr 

TTG GTG GAA GAG AAC 
Leu Val Glu Glu Asn 

CCG GAC GGT TGT TTG 
Pro Asp Gly Cys Leu 

AAC GAC ATG CCC AAT 
Asn Asp Met Pro Asn 

CTG CCG TAC TCG TCA 
Leu Pro Tyr Ser Ser 

CTA ACA CAC AAG GGA 
Leu Thr His Lys Gly 

GGA GAG AAT CCG AAC 
Gly Glu Asn Pro Asn 

TGC GTG TTG CCT CTG 
Cys Val Leu Pro Leu 

TGC GGG TTA CGA GTT 
Cys Gly Leu Arg Val 

GAA ATA TAT GGG TTG 
Glu lie Tyr Gly Leu 

CAT GCC TAT CGT ACA 



ATA TAT CGG TCG AAA 
lie Tyr Arg Ser Lys 

CCT TTA CAT TCG TAT 
Pro Leu His Ser Tyr 

CGC CCC TGT CTG ATC 
Arg Pro Cys Leu lie 

GCT GAG GTT GAG CTC 
Ala Glu Val Glu Leu 

AAA CTC GGC GTT CGA 
Lys Leu Gly Val Arg 

AAC TCG CCG GAG TTC 
Asn Ser Pro Glu Phe 

GGG GCT GCC GCC ACC 
Gly Ala Ala Ala Thr 

ATC AGG AAG CAA GCC 
lie Arg Lys Gin Ala 

CAT GCC TGT TAC TAT 
His Ala Cys Tyr Tyr 

GTT GCC AAG ATC ATA 
Val Ala Lys lie lie 

CAC TTC TCG GAG CTG 
His Phe Ser Glu Leu 

GTA GAG ATT GAC CCC 
Val Glu lie Asp Pro 

GGG ACG ACG GGT TTA 
Gly Thr Thr Gly Leu 

CAA GTG ACG AGT GTG 
Gin Val Thr Ser Val 

CTG TAT ATA CAT AGC 
Leu Tyr lie His Ser 

TTT CAC ATC TAC TCG 
Phe His lie Tyr Ser 

GGT GCG GCG ATT CTG 
Gly Ala Ala lie Leu 

TTA GAG CTG GTC AGA 
Leu Glu Leu Val Arg 

CCC ATC GTA TTG GCA 
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Ser Thr Gly . Asp His His Ala Tyr Arg Thr Pro lie Val Leu Ala 

ATC TCC AAG ACT CCG GAT CTT CAC AAC TAT GAT GTG TCC TC ATT 96 0 

lie Ser Lys Thr Pro Asp Leu His Asn Tyr Asp Val Ser Ser lie 

CGG ACT GTC ATG TCA GGT GCG GCT CCT CTG GGC AAG GAA CT GAA 10 05 

Arg Thr Val Met Ser Gly Ala Ala Pro Leu Gly Lys Glu Leu Glu 

GAT TCT GTC AGA GCT AAG TTT CCC ACC GCC AAA CTT GGT CA^GGA 1050 
Asp Ser Val Arg Ala Lys Phe Pro Thr Ala Lys Leu Gly Gin Gly 

TAT GGA ATG ACG GAG GCA GGG CCC GTG CTA GCG ATG TGT TTG CA 10 95 

Tyr Gly Met Thr Glu Ala Gly Pro Val Leu Ala Met Cys Leu Ala 

TTT GCC AAG GAA GGG TTT GAA ATA AAA TCG GGG GCA TCT GGA AT 114 0 

Phe Ala Lys Glu Gly Phe Glu lie Lys Ser Gly Ala Ser Gly Thr 

GTT TTA AGG AAC GCA CAG ATG AAG ATT GTG GAC CCT GAA ACC GG 1185 
Val Leu Arg Asn Ala Gin Met Lys lie Val Asp Pro Glu Thr Gly 

GTC ACT CTC CCT CGA AAC CAA CCC GGA GAG ATT TGC ATT AGA GGA 123 0 

Val Thr Leu Pro Arg Asn Gin Pro Gly Glu lie Cys lie Arg Gly 

GAC CAA ATC ATG AAA GGT TAT CTT AAT GAT CCT GAG GCG ACG GAG 1275 
Asp Gin lie Met Lys Gly Tyr Leu Asn Asp Pro Glu Ala Thr Glu 

AGA ACC ATA GAC AAG GAA GGT TGG TTA CAC ACA GGT GAT GTG GGC 132 0 

Arg Thr lie Asp Lys Glu Gly Trp Leu His Thr Gly Asp Val Gly 

TAC ATC GAC GAT GAC ACT GAG CTC TTC ATT GTT GAT CGG TTG AAG 136 5 

Tyr lie Asp Asp Asp Thr Glu Leu Phe lie Val Asp Arg Leu Lys 

GAA CTG ATC AAA TAC AAA GGG TTT CAG GTG GCA CCC GCT GAG CTT 1410 
Glu Leu lie Lys Tyr Lys Gly Phe Gin Val Ala Pro Ala Glu Leu 

GAG GCC ATG CTC CTC AAC CAT CCC AAC ATC TCT GAT GCT GCC GTC 14 5 5 

Glu Ala Met Leu Leu Asn His Pro Asn lie Ser Asp Ala Ala Val 

GTC CCA ATG AAA GAC GAT GAA GCT GGA GAG CTC CCT GTG GCG TTT 1500 
Val Pro Met Lys Asp Asp Glu Ala Gly Glu Leu Pro Val Ala Phe 

GTT GTA AGA TCA GAT GGT TCT CAG ATA TCC GAG GCT GAA ATC AGG 1545 
Val Val Arg Ser Asp Gly Ser Gin lie Ser Glu Ala Glu lie Arg 

CAA TAC ATC GCA AAA CAG GTG GTT TTT TAT AAA AGA ATA CAT CGC 15 90 

Gin Tyr lie Ala Lys Gin Val Val Phe Tyr Lys Arg lie His Arg 

GTA TTT TTC GTC GAA GCC ATT CCT AAA GCG CCC TCT GGC AAA ATC 1635 
Val Phe Phe Val Glu Ala lie Pro Lys Ala Pro Ser Gly Lys lie 

TTG CGG AAG GAC CTG AGA GCC AAA TTG GCG TCT GGT CTT CCC AAT 1680 
Leu Arg Lys Asp Leu Arg Ala Lys Leu Ala Ser Gly Leu Pro Asn 

TAATTCTCAT TCGCTACCCT CCTTTCTCTT ATCATACGCC AACACGAACG AAGAGGCTCA 174 0 

ATTAAACGCT GCTCATTCGA AGCGGCTCAA TTAAAGCTGC TCATTCATGT CCACCGAGTG 1800 
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GGCAGCCTGT CTTGTTGGGA TGTTCTTTCA 

ATTATTTATT GTGAAATTCA CAAGAATGTC 

AAAACACTTT TGACATTGTT TACGTTGTAT 

GACTTTTATT TGGGAAGATA ACCTTTCAAA 



TTTGATTCAG CTGTGAGAAG CCAGACCCTC 

TGTAAATCGA TGTTGTGAGT GATGGGTTTC 

TTCCTGCTGT TGAAAATAAC TACTTTGTAT 

AAAAAAAAAA AAAAAA 
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Information for Sequence ID # : 
Correspondence Address : 



Addressee : 
Street : 
City: 
Country: 
Zip : 

Computer Readable Form: 
Medium Type : 
Operating System: 
Software : 

Current Application Data: 
Attorney Information: 
Name : 

Registration Number: 
Reference/Docket Number: 
Telecommunication Information: 
Telephone : 
Telefax: 
Information for Seq ID No: 
Sequence Characteristics : 
Length : 
Type : 

Strandedness : 
Topology: 
Molecule Type : 
Hypothetical : 
Anti-Sense : 
Fragment Type: 
Original Source : 
Organism: 
Strain: 

Individual Isolate : 
Developmental State: 
Haplotype : 
Tissue Type : 
Cell Type: 
Cell Line: 
Organelle : 



5 (PAL) 

Luedeka, Neely & Graham 

P.O. Box 1871 

Knoxville, TN 37901 

Mark S . Graham 

P.O. Box 1871 

Knoxville 

USA 

37901 

1.44 
DOS 
ASCII 
N/A 

Mark S . Graham 
32, 355 
50617. 00 

(423) 546-4305 
(423) 523-4478 



1544 

DNA 

double 

linear 

cDNA 

No 

No 

N/A 

Pinus taeda (L. ) 

Wild Type 

N/A 

sporophyte 

N/A 

xylem 

parenchyma 

N/A 

N/A 
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AAAGATAATA TATGTGTATG C CT ACT ACT A CACATTGTTT TGAAGTGTGT AAACATAGTG 6 0 

CAACACTAGG AGGACTCACA ATGAGCACTT GTTGACATGA AACTAGCTAA ATGCCCAACA 120 

ATATTAGTGA AAGCTAGTTA AACTAACCCC TTTGACTTTC AAGATGATAT ATTTATATCC 180 

CTACTACGTC TTCCTCTTTT TGTCTTTCTC TTGTGATTAA ACCTTCCTTG AAACAATTCT 240 

CAAATGTAAA ATTAAACCTT GAAACTTGTA GAGACCAAAC TTCCCTAGGA GAAACCACAT 300 

TTATGACAAC ATATATACAC CAACCCATTG CATACTATAA TATTGGAATT ACCTGCAGCG 360 

AACGAAAGAA ACGCTGTCTC ACCAACTCGT GCACTACATC CCGAAACTTA ACCTTCCCCT 420 

GATACAGATT GAAGAGCCGA AAAAAGCGTG CATCCAAATT TCTGGTATGG TGAGGAGCCG 4 80 

AAAAACGCGT GCGCCTAATT TTTTTGAGAT GGGCCGGAAA ATAATGCGTG CATCTAAATT 540 

TTCACGTGTC GCGTATTGGC GAGGTTGCGC TGAATGTGAT CCTGTGCGTG AGCCACATTC 600 

ATTCCATTGG TTGACCCGCC GGTACCGCGA GGACCGTGGG GTCTCACAGA TACGCGGATG 660 

GTGGATCAGC ACTGAGAAGA TTAGATGATG ACCAGGCGGG CATTTGAAGT AAAAACTTGG 720 

GGGTGGTTGG CAAGTACGCG ACAAAGAGGG GTAGTGCGCA AGGAAGCGAG TTGGATGCAA 780 

ATAATATTAC AAAGTGGGTT GGTGGGCATG AGCATCAACC AGAATGATGT TGTTGCTGGT 84 0 

TCCGTGCAAA TTCTGACCAG TAGTTTGAAC AATACTACCC AACTTGTTTT TGGTAAAACA 900 

TGAAGTGGGT AAGGAGAATT GAACTTACGT CTCATGGTAA AGGGCAAGGG CAAATGACTT 960 

AACACATACC TTTAACTAAT AAAAATACCC CTAACAAATA CGAAAACGAA TGAGTTATCA 1020 

CAGACCTTCA ACTAATAAGA TAGCCATCAG ACCCACATCT CCTGACTGAC CAAAAACAAA 1080 

TGACTTCAAC CAACTAAGAT AC CCATCAAA GCTAACCCAC AACCCAATTC CTCACTTCCC 114 0 

CTTACCAGAC CAACCAAGCA GACCTACGCC ATTAACTACT TTAGGACGTG GGAATTGGGG 12 00 

GTGCCACCGT TGAAGAATGG CACTCAGGGT TGGTAATCCC TCCACGTGTA TGTAGCAGTC 126 0 

GTTTGGTGGA GACGGCGTGT TTGAATGTCC ACCTTCCAGT TTGGAGAACA AGGAAATTGG 13 2 0 

GCTTATATTA GGCCTGGATC TCTTGTTTCA GAGCAGGAGT AGTTCAGGAC AGGAACTAGC 13 8 0 

ATTCAAGAAT TCAATTGCCC TGCCCTGCTC TGCTCTGCTT TGCTCAACTT ATTGATCCCT 144 0 

GCTCTGGTTT GTTCAATTTC TTGACCCCTG CTGGGTTCTG CTCTGGTTTG CACACTTTCT 1500 

CGATTATATA AGTCATTTTG GATCCTTG C A AGGAAGAGAA TATG 1544 
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Information for Sequence ID # : 
Correspondence Address : 



Addressee : 
Street : 
City: 
Country : 
Zip : 

Computer Readable Form: 
Medium Type: 
Operating System: 
Software : 

Current Application Data: 
Attorney Information: 
Name : 

Registration Number: 
Reference /Docket Number : 
Telecommunication Information: 
Telephone : 
Telefax: 
Information for Seq ID No : 
Sequence Characteristics: 
Length : 
Type : 

Strandedness : 
Topology: 
Molecule Type: 
Hypothetical : 
Anti-Sense : 
Fragment Type : 
Original Source : 
Organism: 
Strain: 

Individual Isolate : 
Developmental State: 
Haplotype : 
Tissue Type: 
Cell Type: 
Cell Line: 
Organelle : 



6 (4CL1B) 

Luedeka, Neely & Graham 

P.O. Box 1871 

Knoxville, TN 37901 

Mark S . Graham 

P.O. Box 1871 

Knoxville 

USA 

37901 

1 . 44 
DOS 
ASCII 
N/A 

Mark S . Graham 
32, 355 
50617.00 

(423) 546-4305 
(423) 523-4478 
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No 

N/A 
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AAACACCAAT TTAATGGGAT TTCAGATTTG TATCCCATGC TATTGGCTAA GGCATTTTTC 6 0 

TTATTGTAAT CTAACCAATT CTAATTTCCA CCCTGGTGTG AACTGACTGA CAAATGCGGT 120 

CCGAAAACAG CGAATGAAAT GTCTGGGTGA TCGGTCAAAC AAG CGGTGGG CGAGAGAGCG 180 

CGGGTGTTGG CCTAGCCGGG ATGGGGGTAG GTAGACGGCG TATTACCGGC GAGTTGTCCG 240 

AATGGAGTTT TCGGGGTAGG TAGTAACGTA GACGTCAATG GAAAAAGTCA TAATCTCCGT 3 00 

CAAAAATCCA ACCGCTCCTT CACATCGCAG AGTTGGTGGC CACGGGACCC TCCACCCACT 360 

CACTCAATCG ATCGCCTGCC GTGGTTGCCC ATTATTCAAC CATACGCCAC TTGACTCTTC 42 0 

ACCAACAATT CCAGGCCGGC TTTCTATACA ATGTACTGCA CAGGAAAATC CAATATAAAA 480 

AGCCGGCCTC TGCTTCCTTC TCAGTAGCCC CCAGCTCATT CAATTCTTCC CACTGCAGGC 540 

TACATTTGTC AGACACGTTT TCCGCCATTT TTCGCCTGTT TCTGCGGAGA ATTTGATCAG 6 00 

GTTCGGATTG GGATTGAATC AATTGAAAGG TTTTTATTTT CAGTATTTCG ATCGCCATG 659 
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GGCCGGGTGG TGACATTTAT TCATAAATTC ATCTCAAAAC AAGAAGGATT TACAAAAATA 6 0 

AAAGAAAACA AAATTTTCAT CTTTAACATA ATTATAATTG TGTTCACAAA ATTCAAACTT 12 0 

AAACCCTTAA TATAAAGAAT TTCTTTCAAC AATACACTTT AATCACAACT TCTTCAATCA 180 

CAACCTCCTC CAACAAAATT AAAATAGATT AATAAATAAA TAAACTTAAC TATTTAAAAA 240 

AAAATATTAT ACAAAATTTA TTAAAACTTC AAAATAAACA AACTTTTTAT ACAAAATTCA 3 00 

TCAAAACTTT AAAATAAAGC TAAACACTGA AAATGTGAGT ACATTTAAAA GGACG CTG AT 360 

CACAAAAATT TTGAAAACAT AAACAAACTT GAAACTCTAC CTTTTAAGAA TGAGTTTGTC 420 

GTCTCATTAA CTCATTAGTT TTATAGTTCG AATCCAATTA ACGTATCTTT TATTTTATGG 480 

AATAAGGGTG TTTTAATAAG TGATTTTGGG ATTTTTTTAG TAATTTATTT GTGATATGTT 540 

ATGGAGTTTT TAAAAATATA TATATATATA TATATTTTTG GGTTGAGTTT ACTTAAAATT 600 

TGGAAAAGGT TGGTAAGAAC TATAAATTGA GTTGTGAATG AGTGTTTTAT GGATTTTTTA 66 0 

AGATGTTAAA TTTATATATG TAATTAAAAT TTTATTTTGA ATAACAAAAA TTATAATTGG 720 

ATAAAAAATT GTTTTGTTAA ATTTAGAGTA AAAATTTCAA AATCTAAAAT AATTAAACAC 7 80 

TATTATTTTT AAAAAATTTG TTGGTAAATT TTATCTTATA TTTAAGTTAA AATTTAGAAA 84 0 

AAATTAATTT TAAATTAATA AACTTTTGAA GTCAAATATT CCAAATATTT TCCAAAATAT 900 

TAAATCTATT TTGCATTCAA AATACAATTT AAATAATAAA ACTTCATGGA ATAGATTAAC 960 

CAATTTGTAT AAAAACCAAA AATCTCAAAT AAAATTTAAA TTACAAAACA TTATCAACAT 1020 

TATGATTTCA AGAAAGACAA TAACCAGTTT CCAATAAAAT AAAAAACCTC ATGGCCCGTA 1080 

ATTAAGATCT CATTAATTAA TTCTTATTTT TTAATTTTTT TACATAGAAA ATATCTTTAT 114 0 

ATTGTATCCA AGAAATATAG AATGTTCTCG TCCAGGGACT ATTAATCTCC AAACAAGTTT 12 00 

CAAAATCATT ACATTAAAGC TCATCATGTC ATTTGTGGAT TGGAAATTAT ATTGTATAAG 12 60 

AGAAATATAG AATGTTCTCG TCTAGGGACT ATTAATTTCC AAACAAATTT CAAAATCATT 132 0 

ACATTAAAGC TCATCATGTC ATTTGTGGAT TGGAAATTAG ACAAAAAAAA TCCCAAATAT 13 80 

TTCTCTCAAT CTCCCAAAAT ATAGTTCGAA CTCCATATTT TTGGAAATTG AGAATTTTTT 1440 

TACCCAATAA TATATTTTTT TATACATTTT AGAGATTTTC CAGACATATT TGCTCTGGGA 150 0 

TTTATTGGAA TGAAGGTTGA GTTATAAACT TTCAGTAATC CAAGTATCTT CGGTTTTTGA 1560 

AGATACTAAA TCCATTATAT AATAAAAACA CATTTTAAAC ACCAATTTAA TGGGATTTCA 162 0 

GATTTGTATC CCATGCTATT GGCTAAGGCA TTTTTCTTAT TGTAATCTAA CCAATTCTAA 168 0 
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TTTCCACCCT GGTGTGAACT GACTGACAAA TGCGGTCCGA AAACAGCGAA TGAAATGTCT 1740 

GGGTGATCGG TCAAACAAGC GGTGGGCGAG AGAGCGCGGG TGTTGGCCTA GCCGGGATGG 18 00 

GGGTAGGTAG ACGGCGTATT ACCGGCGAGT TGTCCGAATG GAGTTTTCGG GGTAGGTAGT 186 0 

AACGTAGACG TCAATGGAAA AAGTCATAAT CTCCGTCAAA AATCCAACCG CTCCTTCACA 192 0 

TCGCAGAGTT GGTGGCCACG GGACCCTCCA CCCACTCACT CGATCGCCTG CCGTGGTTGC 1980 

CCATTATTCA ACCATACGCC ACTTGACTCT TCACCAACAA TTCCAGGCCG GCTTTCTATA 2040 

CAATGTACTG CACAGGAAAA TCCAATATAA AAAGCCGGCC TCTGCTTCCT TCTCAGTAGC 2100 

CCCCAGCTCA TTCAATTCTT CCCACTGCAG GCTACATTTG TCAGACACGT TTTCCGCCAT 2160 

TTTTCGCCTG TTTCTGCGGA GAATTTGATC AGGTTCGGAT TGGGATTGAA TCAATTGAAA 2220 

GGTTTTTATT TTCAGTATTT CGATCGCCAT G 22 51 
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